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The  objective  of  this  work  is  to  develop  a  robust  guidance  and  control  architecture  for 
autonomous  reusable  launch  vehicles  that  incorporates  elements  of  recent  advances  in  the 
areas  of  optimal  trajectory  generation  and  reconfigurable  control.  This  work  integrates 
three  separately  developed  methods  to  form  a  coherent  architecture  with  the  potential  to 
manage  control  effector  failures,  vehicle  structural/aerodynamic  degradation,  uncertainty, 
and  external  disturbances.  Outer-loop  guidance  commands  in  the  form  of  body-frame 
angular  rates  (roll,  pitch,  and  yaw)  are  generated  from  an  optimal  reference  trajectory  that 
is  computed  off-line  with  a  direct  pseudospectral  method  and  then  tracked  by  a 
reconfigurable  inner-loop  control  law.  The  appropriate  open-loop  state  histories  from  the 
psuedo-four-degree-of-freedom  reference  trajectory  are  converted  using  a  modified 
backstepping  approach  that  complements  the  inner-loop  control  law  in  a  six-degree-of- 
freedom  simulation.  The  inner-loop  control  law  is  capable  of  reacting  and  compensating  for 
off-nominal  conditions  by  employing  nonlinear  reconfigurable  control  allocation,  dynamic 
inversion,  and  model-following/anti-windup  prefilters.  The  results  show  that  the  inner-loop 
control  can  adequately  track  the  desired  optimal  guidance  commands;  thus,  confirming  the 
applicability  of  this  control  architecture  for  future  development  involving  on-line,  optimal 
trajectory  generation  and  high-fidelity  guidance  and  control  for  reentry  vehicles. 


I.  Introduction 

The  safety  of  manned  or  unmanned  space  vehicles  returning  to  earth  is  of  paramount  importance.  One  way  to 
improve  their  safety  is  by  creating  more  capable,  reliable  and  efficient  guidance,  navigation,  and  control  methods. 
Onboard,  real-time  optimal  trajectory  generation,  planning,  adaptation,  reconfiguration,  and  retargeting  are  the 
methods  currently  being  pursued  to  achieve  the  autonomous  operations  needed  to  facilitate  the  accomplishment  of 
these  objectives. 

Previous  work  by  Shaffer  [l]-[2]  has  integrated  trajectory  reshaping  and  retargeting  with  the  reconfigurable 
control  work  of  Oppenheimer  et  al.  [3] -[5],  to  demonstrate  relatively  fast  computations  of  optimal  trajectories  under 
trim  deficient  path-constraints.  This  work  used  interpolated  aerodynamic  data  that  incorporated  wing,  body  and  trim 
effects  from  a  given  vehicle  flight  condition  and  an  optimized  effector  displacement  vector  [2].  This  essentially 
decoupled  the  outer  loop  from  the  inner  loop.  Despite  efforts  involving  on-line  approaches  [6]-[ll],  off-line 
reference  trajectories  are  still  used  for  tracking  applications.  Since  the  early  days  of  space  shuttle  entry  guidance, 
designers  have  been  employing  various  reference  trajectory  tracking  schemes  [12].  Various  research  has  addressed 
the  reentry  problem  by  using  an  optimal  trajectory  generator  to  solve  for  a  reference  input  trajectory  off-line,  then 
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use  other  inner-loop  control  means  to  track  the  desired  trajectory  [13]-[15].  In  some  cases,  off-line  reference 
trajectories  are  combined  with  on-line  trajectory  generators  such  as  the  “Optimum-Path-To-Go”  methodology 
developed  by  Schierman  et  ah,  as  previously  cited. 

In  a  similar  fashion,  this  paper  combines  some  of  the  approaches  mentioned  to  demonstrate  that  a  previously 
developed  inner-loop  control  design,  based  on  dynamic  inversion  (DI),  can  successfully  track  variable  body-axis  roll, 
pitch,  and  yaw  commands  generated  from  an  off-line,  optimal  reference  trajectory.  Note  that  although  the  reference 
trajectory  is  generated  off-line  for  this  paper,  concurrent  work,  as  demonstrated  in  Ref.  [16],  shows  that  rapid 
trajectory  generation  is  capable  of  providing  optimal  nonlinear  feedback;  hence,  making  this  work  viable  for  on-line 
applications. 

To  solve  the  optimal  control  problem,  a  spectral  algorithm  [17]-[19]  known  as  the  Legendre  Pseudospectral 
Method  is  employed  by  use  of  a  MATLAB-based  software  package  called  DIDO  [20].  This  direct  method 
discretizes  the  problem  and  approximates  the  states,  co-states  and  control  variables  by  use  of  Lagrange  interpolating 
polynomials  where  the  unknown  coefficient  values  coincide  with  the  Legendre-Gauss-Lobatto  (LGL)  node  points. 
After  this  approximation  step,  a  nonlinear  program  (NLP)  solver  (SNOPT)  solves  a  sequence  of  finite-dimensional 
optimization  problems  that  capture  the  nonlinearites  of  the  system  in  the  form  of  an  optimal  control  problem.  For  an 
extensive  description  of  this  method  and  its  use  for  reentry  applications,  see  references  [l]-[2],  [16],  and  [17] -[22]. 

The  overall  goal  of  this  work  is  to  extend  recent  developments  in  the  areas  of  optimal  trajectory  generation  and 
reconfigurable  control  by  forming  a  robust  guidance  and  control  (G&C)  architecture  that  combines  three  separately 
developed  methods:  (1)  optimal  trajectory  generation,  (2)  guidance  command  generation  based  on  proportional- 
integral  (PI)-loop  closure  backstepping,  and  (3)  reconfigurable  inner-loop  control.  The  following  list  identifies  the 
specific  objectives  for  this  study. 

1 .  To  see  how  well  the  inner-loop  controller  tracks  the  optimal  command  histories  and  remedy  any  problems 

2.  To  verify  what  the  body-frame  angular  rates  (P,Q,R)  should  be  (steady-state  trim  values?)  since  previously 
assumed  constant  in  other  studies 

3.  To  provide  initial  guesses  for  other  6-DOF  optimal  reentry  trajectory  studies  using  DIDO 

4.  To  provide  a  baseline  for  comparing  6-DOF  simulation  control  deflection  histories  to  optimal  deflections 
computed  by  a  6-DOF  DIDO  model 

5.  To  provide  a  baseline  for  planned  studies  involving  a  single,  “integrated”  optimal  G&C  architecture 

II.  Guidance  and  Control  Design  Architecture 

The  overall  G&C  design  architecture  for  this  work  is  presented  in  Fig.  1.  The  architecture  consists  of 


state  Feedback 


a,  (p  Feedback 

Fig.  1  Conceptual  G&C  Design  Architecture 

a  two-loop  structure:  (1)  an  outer  loop  that  compares  the  actual  angle-of-attack  and  the  bank  angle  measurements 
with  those  provided  from  the  optimal  reference  trajectory  outputs  and  (2)  an  inner-loop  that  is  designed  to  track  the 
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optimal  body-rates  (P,Q,R)  generated  from  the  guidance  command  generator.  For  this  work,  a  full,  six-degree-of- 
freedom  (6-DOF)  model  of  a  reusable  launch  vehicle  (RLV)  was  employed  for  the  inner-loop  tracking  simulation 
whereas  the  reference  trajectory  was  generated  using  a  psuedo-four-degree-of-freedom  (4-DOF)  model.  Details  of 
the  outer  and  inner-loops  are  presented  in  sections  III  and  IV,  respectively. 

III.  Outer-Loop  Guidance  Command  Generation 

This  section  provides  some  details  of  how  the  off-line  optimal  trajectory  is  generated  with  on-line  viability  and 
then  converted  into  useful  guidance  commands. 

A.  Off-Line,  Optimal  Trajectory  Generation 

First,  an  off-line  reference  trajectory  is  generated  by  posing  the  reentry  problem  as  a  standard  optimal  control 
problem  (OCP)  and  solving  for  the  extremal  controls  using  DIDO.  For  this  paper,  a  reduced-order  model  is 
adequate  to  demonstrate  the  feasibility  of  the  approach.  Thus,  the  full  6-DOF  equations  of  motion  (EoM)  are 
simplified  and  decoupled.  The  model  used  here  assumes  a  point-mass-model  over  a  flat,  non-rotating  earth  such  that 
the  positional  and  translational  equations  of  motion  in  a  Cartesian  “local  horizontal”  coordinate  system  become 

X  =  Fcos/cosi// 
y  =  V  cos;rsin^ 
z  =  F  sin/ 

F  = - gsm/ 

m 

L  geos/ 

/  = - 

mF  F 

L  sin  d) 

y/  = - ^ 

mF  cos/ 

where  x  (down-range),  y  (cross-range),  and  z  (altitude)  are  the  vehicle’s  position  with  respect  to  the  fixed-earth 
reference  frame,  V  is  the  velocity  magnitude  (i.e.  total  equivalent  airspeed),  /  is  the  flight-path-angle  (FPA),  y/  is 

the  heading  angle  (HA),  a  is  the  angle-of-attack  (AoA),  (j)  is  the  bank  angle  (BA),  and  m  is  the  vehicle’s 
approximate  mass  during  reentry  modeled  as  2455  slugs  (-79,000  lbs).  In  Eq.  (I),  the  lift  and  drag  forces  are 
represented  as  L  and  D,  respectively,  and  are  given  by 


L^^p{z)V^C,{a,M)S,,, 

(2) 

(3) 

where  =  1600  tf  is  the  aerodynamic  reference  area.  As  noted  in  Eqs.  (2)  and  (3),  the  aerodynamic  coefficients 

are  assumed  to  be  functions  of  state  variables  only: 

C„C,=f(a,M) 

(4) 

and  the  Mach  number  and  atmospheric  density  are  functions  of  altitude: 

M=M{z) 

(5) 

II 

(6) 
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The  lift  and  drag  coefficients  are  computed  using  table  lookup  data  that  incorporates  wing,  body,  and  trim  effects. 
Likewise,  the  Mach  and  density  are  computed  using  table  lookup  data  based  on  a  standard  1976  atmospheric  model. 
See  Ref  [2]  for  more  details  on  the  use  of  table  lookup  data  for  a  similar  model. 

The  optimized  controls  for  this  problem  are  essentially  the  standard  AoA  and  BA  modulation,  but  to  help 
compensate  for  command  delays  (i.e.  lags)  and  to  add  more  realism/fidelity  to  the  problem,  as  explained  in  Ref  [1] 
and  [2],  the  rates  of  these  angles  are  used  as  “virtual”  controls.  This  has  the  benefit  of  allowing  rate  limits  on  AoA 
and  BA  which  prevents  unrealistic  responses.  Therefore,  the  control  vector  is  defined  as 

u=\u^, 

and  the  state  vector  is 

x  =  [xyzV  Y  y/  a  (pf 


(7) 

(8) 


As  with  any  dynamical  optimization  problem,  the  cost  function  (objective  function),  governing  EoM,  path 
constraints,  boundary  limits  on  initial/fmal  conditions,  and  any  constraints  (on  states  and/or  controls)  must  be 
defined.  As  such,  the  general  OCP  for  trajectory  generation  is  fully  posed  in  the  following  manner: 

min^  J {x{t) ,u{t) ,T^,T =  e{^x{t^) ,x{t ,T^,T +  ^ F [x{t) ,u{t) ,T)dT 

s„.ec..o 

h  -  h{x,u,T)  < 

Xi<x{t)<x^ 

u^<u{t)<u^ 

The  goal  is  to  find  a  state-control  function  pair,  |^  (*)  ?  (*)!  ,  or  sometimes  time,  T  ,  that  minimizes  the 

performance  index  represented  by  the  Bolza  form,  -^  (*)  ,  consisting  of  either  a  Mayer  term,  ,3^  Lagrange 
term,  ,  or  both  as  stated  above. 

Summarizing  the  previous  reentry  equations,  the  specific  optimal  control  formulation  for  this  RLV  problem  is 
stated  as  follows:  Given  an  initial  position  vector  ([xq,  Yo,  Zo  ,  velocity  magnitude  (^o  )  ’  heading 

angle  (^q )  ’  A^A  (^OCq)  ,  and  BA  (^q )  ’  ^he  control  history  that  maximizes  the  horizontal 

downrange  )  or  cross-range  (t/  )  under  various  constraints.  In  the  context  of  Eq.  (9)  above  and  for  the 
analysis  presented  in  this  work,  the  cost  functions  are  simply: 

Min  {  /[•]  =  -  -  or  -  Min  {  /[•]  =  -  y^}  (10) 

subject  to  the  dynamic  constraints  given  by  Eq.  (1),  the  initial  and  final  event  conditions  specified  as: 
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(11) 


and  the  state  (14),  path  (15),  and  control  (16)  inequality  constraints,  respectively,  specified  as 
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(12) 

(13) 


(14) 


(15) 


s  s 


where  the  path  constraint  terms  represent  the  normal  acceleration  =  |Z  COS  a  +  D  sin  | ,  the  dynamic  pressure 

^  —  p(z)F^  ,  and  the  heating  rate  Q  =  k^J p{r)  with  constant  k  based  on  the  vehicle’s  heat  shield 

properties. 


B.  On-Line,  Optimal  Trajectory  Generation 

Although  this  work  computes  the  optimal  reference  trajectory  off-line  and  then  extracts  the  appropriate  signals  to 
use  in  the  guidance  command  generation  algorithm,  preliminary  studies  conducted  concurrently  with  this  work  have 
indicated  that  the  same  model  using  approximated  aerodynamic  data  can  solve  the  problem  approximately  85  % 
faster  than  using  the  table  look-up  data.  For  example,  recent  work  used  a  second-order  polynomial  approximation 
for  lift  and  drag  coefficients  and  a  standard  two-parameter  exponential  atmospheric  model  that  resulted  in  the 
successful  implementation  of  a  nonlinear  sampled-data  feedback  method  with  an  on-line,  trajectory  re-optimization 
scheme  that  could  generate  optimal  trajectories  99.75  %  faster  than  the  same  model  using  the  table  look-up  data  [16]. 
Further  work  is  required  to  improve  the  accuracy  of  the  approximations,  but  initial  results  look  promising  for  on-line 
reentry  applications. 
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C.  Command  Generation  via  “Backstepping  Architecture”  (PI  &  DI) 

From  the  optimal  trajectory,  the  a  and  ^  commands  are  converted  into  the  body-axis  angular  velocities  (P,Q,R)  to 
provide  the  desired  inner-loop  commands.  The  generation  of  these  commands  is  based  on  what  Schierman  defines 
as  a  “backstepping”  approach  whereby  the  “pseudo-commands”  at  each  loop-closure,  using  Pl-control  and  DI, 
drives  the  next  inner-most  loop  [23].  Common  loop  closures  may  consist  of  an  outer-most  altitude  loop,  a  FPA  loop, 
and  an  enclosed  inner-most  AoA  loop. 

For  this  experiment,  the  3 -DOF  DIDO  trajectory  provides  the  a  and  ^  commands  that  are  then  used  to  generate 
the  body-rate  commands  (Pcmd,  Qcmd,  Rcmd)-  For  example,  assuming  only  longitudinal  motion,  the  appropriate  pitch 
rate  command  is  generated  based  on  the  following  calculations.  Ignoring  lateral-directional  influences  (for  now), 

the  wind-axis  relation  a  =  9  —  /  and  the  simplified  pitch  rate  Q  =  0  provide  the  governing  EoM  such  that 


a  =  -f  +  Q 

Also,  the  governing  EoM  for  the  FPA  is 

.  _  L  gcos(0) 
mV  V 


Substituting  Eq.  (17)  into  Eq.  (18),  the  resulting  pitch-rate  command  is  derived  as 


Qcmd 


■■  a 


des 


L 

H - 

mV 


gcos(x) 

V 


(17) 


(18) 


(19) 


To  improve  a  tracking,  the  desired  a  dynamics  are  generated  using  a  proportional  feedback  controller 

(^des=KMcmd-(^)  (20) 

where  is  the  optimal  a  command  from  the  3 -DOF  DIDO  trajectory.  Figure  2  shows  a  block  diagram  that 

represents  the  computation  of  the  optimal  guidance  commands. 


Fig.  2  P  and  Q  Command  Generation 
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Note  that  for  the  Qcmd  generation  in  Fig.  2,  an  extra  lateral  term  is  added  to  account  for  lateral  effects  as  explained  in 
the  results  section.  Also,  not  shown  in  Fig.  2  ,  is  the  generation  of  the  yaw  command  (Rcmd)  that  is  computed 
according  to  Ref  [27  ]  assuming  coordinated  turns  and  is  given  by 


Rcmd^Pcmd  tana  + 


gsin^ 


u 


(21) 


It  is  also  important  to  note  that  throughout  this  paper  the  notation  (f)  is  used  for  both  BA  and  roll  angle  since  they 
are  assumed  equal  for  this  study. 


IV.  Reconfigurable  Inner-Loop  Control 

This  work  implemented  a  6-DOF  simulation  containing  a  reconfigurable  inner-loop  control  algorithm  that  uses 
DI,  control  allocation,  and  model  following  prefilters  with  integrator  anti-windup  and  reference  model  bandwidth 
attenuation.  Note  that  although  not  discussed  here  for  purpose  of  brevity,  the  simulation  model  employs  Etkin’s 
standard  6-DOF  EoM  [28]. 

A.  Dynamic  Inversion  and  Control  Allocation 

The  inner-loop  control  system  uses  DI  in  order  to  track  the  desired  body-frame  angular 
velocities  )  .  The  rotational  dynamics  for  this  type  of  vehicle  can  be  written  as: 

/<y  =  Gg (22) 


where  I  is  the  moment-of-inertia  tensor,  =  and  is  a  vector  consisting  of  the  total  moments  acting 

on  the  vehicle  with  contributions  from  the  wing-body-propulsion  system  (BAE)  and  the  control  effectors  {S)  such 
that 


'  l' 

'  L~ 

Gb  =  Gbae  +  Gg  (P,  S)  = 

M 

+ 

M 

N 

BAE 

N 

(23) 


where  L,  M,  and  N  are  the  rolling,  pitching,  and  yawing  moments,  respectively;  the  vector  P  denotes  a  measurable 
or  estimable  quantity  that  can  influence  body  rates  and  can  contain  variables  such  as  AoA,  sideslip,  Mach  number, 

and  mass  properties;  and  S  is  a  vector  of  control  surface  deflections  given  by  c52 , ^5^  ]  .  To  design  the 

DI  control  law,  equations  (22)  and  (23)  are  put  into  a  more  standard  form  by  defining 
f  P)  =  G^^^  (cO,  P)  —  COX  I  CO  such  that 

I(b  =  f{co,P)  +  G,{P,5)  (24) 


The  objective  is  to  find  a  control  law  that  provides  direct  control  over  Q)  such  that  d>  =  ;  therefore,  the  DI 

control  law  must  satisfy 


I(b,^^-f{(o,P)  =  G,{P,d) 

But,  since  this  problem  has  more  control  effectors  than  control  variables,  a  control  allocation  algorithm  is  required 
for  a  unique  solution. 

A  linear  programming  based  control  allocator,  which  obeys  rate  and  position  limits,  is  used  in  this  work.  To 
implement  this  type  of  allocator,  the  control  dependent  portion  of  Eq.  (25)  must  be  linear  in  the  controls.  Hence,  Eq. 
(25)  is  rewritten  as 
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Imes-f{co,P)  =  G,{P,d)  =  G,{P)d  (26) 

In  order  to  account  for  nonlinearities  in  the  moment-deflection  relationship,  a  slope-intercept  term  is  added  to  Eq. 
(26)  such  that 


/  (Odes  -fia),P)  =  GgiP)d  +  £(P,  S) 

Then,  the  final  inverse  control  law  becomes 

-  f((0, P)  - I-^e(P, S)  =  I-%(P)S 


(27) 


(28) 


For  more  details  on  this  DI  method  and  the  control  allocation  algorithm  see  references  [3]-[5]  and  [23]-[25].  A 
block  diagram  representation  of  the  dynamic  inversion  control  law  is  shown  in  Fig.  3. 


Dynamic  Inversion 


Fig.  3  Dynamic  Inversion  with  Control  Allocation 
B.  Model  Following  Prefilters 

To  provide  robustness  to  modeling  errors,  inversion  errors,  and  to  help  shape  the  closed-loop  response,  prefilters 
were  added  to  the  DI  control  system  as  shown  in  Fig.  4.  Previous  work  involving  the  inner-loop  control  designs  for 
the  X-40A  tested  two  different  prefilter  structures:  implicit  [4]  and  explicit  [23].  For  this  work,  an  implicit  model¬ 
following  scheme  was  selected  based  on  its  simplicity  in  regards  to  having  fewer  gains  that  would  ultimately  need 
tuning.  Also,  it  was  desired  that  the  closed-inner-loop  control  system  from  to  co  has  the  characteristics  of  a 
first-order  response.  The  implicit  structure  presented  in  Fig.  5  provides  this  behavior  and  helps  compensate  for 
imperfections  in  the  DI  control  law.  A  closer  look  at  this  structure  with  some  straight-forward  block  diagram 
algebra  reveals  that  a  stable  pole/zero  cancellation  occurs.  The  resulting  transfer  function  displays  the  desired 
closed-inner-loop  response: 


^  =  2  (29) 

^des  s  + 

2 

Note  that  Fig.  5  only  displays  a  single  loop;  however,  the  actual  model  implemented  contained  a  loop  for  each  of  the 
body-axis  angular  rates. 
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Dynamic  Inversion 


Fig.  4  Dynamic  Inversion  with  Control  Allocation  and  Prefilters 


C.  Integrator  Anti-Windup  and  Reference  Model  Bandwidth  Attenuation 

Axis  saturation  occurs  when  all  control  power  is  used  on  one  or  more  axes.  For  flight  control  applications,  when 
a  control  surface  moves  at  its  rate  limit  or  resides  on  a  position  limit,  then  control  effector  saturation  occurs.  This  is 
a  necessary,  but  not  sufficient,  situation  for  axis  saturation.  With  axis  saturation,  no  additional  control  power  is 
available  when  requested  by  the  control  system  and  this  should  be  taken  into  account  by  the  control  law.  Analysis 
of  the  control  allocation  inputs  (ddes)  and  outputs  (B5)  can  indicate  axis  saturation.  To  prevent  canceling  tracking 
errors  caused  by  the  axis  saturation,  the  following  integrator  anti-windup  law  is  added  to  reduce  the  magnitude  of 
input  signal  to  the  integrator. 

where  is  the  anti-windup  gain,  is  the  desired  accelerations  from  the  control  effectors,  and  is  what  the 
control  allocator  thinks  is  being  produced  by  the  effectors.  If  no  saturation  occurs,  then  =  0  and  the 

control  law  operates  normally;  otherwise,  at  least  one  axis  is  saturated  and  the  state  of  the  prefilter  integrator  is 
reduced  by  the  anti-windup  signal.  The  anti-windup  scheme  is  implemented  as  depicted  in  Fig.  5. 


Fig.  5  Implicit  Model  Following  Prefilter  Integrator  Anti-Windup  Compensation 
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For  more  details  on  the  anti-windup  integrator  and  its  use  see  Ref.  [26]  and  [23]. 


V.  Results  and  Discussions 

The  primary  performance  concerns  for  this  work  were  tracking  error  and  control  saturation.  Of  course  the  overall 
index  of  performance  is  that  the  cost  functions  for  both  the  outer-loop  guidance  and  inner-loop  control  agree  to 
within  an  acceptable  tolerance.  As  long  as  the  desired  trajectory  and  cost  were  accomplished,  the  tracking 
performance  was  only  graphically  confirmed.  For  this  paper,  only  the  max  downrange  and  max  cross-range  results 
are  presented. 

A.  Max  Downrange  (DR)  Case 

For  the  case  of  maximizing  the  vehicle’s  downrange  distance,  the  3 -DOF  optimal  trajectory  is  generated  off-line 
and  a  and  (|)  histories  are  extracted  for  use  as  the  desired  guidance  commands.  These  command  profiles  are  shown 
in  Fig.  6. 


Time  (sec) 

Fig.  6  Max  DR  Commands  from  3-DOF  Optimal  Reference  Trajectory 

Initial  comparison  of  the  PQR-guidance  commands  with  the  actual  PQR-states  revealed  that  the  inner-loop 
controller  was  successfully  tracking  the  desired  values;  however,  when  comparing  the  actual  states  to  those  of  the 
reference  trajectory,  there  were  some  unacceptable  errors,  especially  for  a.  After  carefully  reviewing  the  data,  trial- 
and-error  gain  tuning  on  the  prefilter  and  anti-windup  gains,  it  was  determined  that  adding  an  integrator  in  the 
command  generation  block.  Fig.  2,  improves  the  reference  trajectory  tracking  as  shown  in  Fig.  7.  With  this 


Fig.  7  Comparison  of  AoA  with  and  without  Integrator  in  Command  Generation  Logic 
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addition  and  use  of  the  gains  in  Table  1,  the  max  DR  results  were  acceptable  with  an  average  difference  of  only  2  % 
between  the  actual  and  commanded/desired  values  as  seen  in  figures  9,  10,  and  11.  The  cost  for  the  optimal 
reference  trajectory  and  the  simulation  were  1,515,588  ft  and  1,515,852  ft,  respectively,  which  results  in  only  a 
0.017  %  error. 


Table  1  Tuned  Gains  used  in  Inner-Loop  Control  System 


Gain  Type 


Max  DR 


Max  CR 


Prefilter  BW,  P  (Kbp) 
Prefilter  BW,  Q  (K^q) 
Prefilter  BW,  R  (KbR) 
Proportional  DI  (Kp) 
Integral  DI  (Kj) 
Anti-Windup,  P  (Kaw,p) 
Anti- Windup,  Q  (Kaw,q) 
Anti-Windup,  R  (Kaw,r) 


5.0 

4.0 

5.0 

4.0 

5.0 

4.0 

0.8 

0.9 

0.5 

0.5 

0.2 

0.1 

0.2 

0.1 

0.2 

0.1 

2 

o'  1 
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Fig.  8  Max  DR  Linear  Velocities  (U,V,W) 
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Fig.  10  Max  DR  Euler  Angles  (Phi,  Theta,  Psi) 


12 


Time  (sec) 


Fig.  11  Max  DR  Airspeed  (V)  and  Wind-Relative  Angles  (Alpha,  Beta,  Gamma) 
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Fig.  12  Max  DR  Eleven  Control  Surface  Deflections  (Right  Eleven  Inboard,  etc...) 
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Fig.  14  Max  DR  Rudder  Control  Surface  Deflections 


Figures  12-14  show  that  all  of  the  control  surface  deflections  in  the  6-DOF  simulation  remain  within  their  respective 
limits  as  indicated  by  the  dash-dot  lines.  As  indicated  in  Fig.  13,  there  is  only  one  region  of  body-flap  control 
saturation  from  approximately  290-325  sec  that  may  be  related  to  an  initial  pitch  down  attitude  just  prior  to  what 
appears  to  be  a  steady-state  trimmed  condition  from  approximately  425-980  sec.  Figure  15  shows  that  the  anti¬ 
windup  values  remain  approximately  zero  (10'^^)  throughout  the  flight  simulation. 
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Fig.  15  Max  DR  Anti-Windup  Signals 


Fig.  16  Max  DR  Comparison  of  Reference  and  Tracking  Trajectory 


In  addition  to  checking  that  the  system  adequately  tracks  the  desired  guidance  commands,  Fig.  16  shows  that  these 
commands  result  in  an  acceptable  reentry  trajectory  when  comparing  the  6-DOF  simulation  and  the  3 -DOF 
reference  x,y,z-state  histories. 

B.  Max  Cross-Range  (CR)  Case 

For  the  case  of  maximizing  the  vehicle’s  cross-range  distance,  a  and  ^  histories  are  extracted  from  the  off-line 
optimal  trajectory  as  was  done  for  the  max  DR  case.  These  command  profiles  are  shown  in  Fig.  18. 
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Fig.  17  Max  CR  Commands  from  3-DOF  Optimal  Reference  Trajectory 


For  maneuvers  with  large  bank  angles,  such  as  the  max  CR  case,  there  is  a  loss  of  lift  that  must  be  countered  in 
order  to  maintain  altitude.  Typically,  the  elevator  is  used  to  provide  more  lift.  In  short,  the  longitudinal  and  lateral 
equations  of  motion  are  strongly  coupled  during  maneuvers  with  large  bank  angles.  For  example,  a  pilot  must 
maintain  adequate  backpressure  on  the  yoke  during  steep  turns  to  prevent  loss  of  altitude.  To  account  for  this  in  the 

pitch-command  generation,  the  following  extra  lift  term  [28]  was  added  to  provide  the  appropriate 

contribution  from  the  bank  angle: 


a 

L  =  mg  sec((zJ)  ^  ^sec((zJ) 


(31) 


Similarly,  the  simplifying  assumption  that  6  —  Q  used  for  the  backstepping  architecture  in  section  III.C  is  only 
valid  when  the  roll  angle  (^)  is  sufficiently  small.  For  the  maximum  CR  trajectory  where  the  roll  angle  may  be 
large,  it  is  better  to  use  the  relation  0  =  Q  COS  (p  —  R  sin  (j)  [28]  such  that  Eq.  (17)  becomes 

d  = -y  +  Q cos (!>- Rs\n(f>  (32) 

and  Eq.  (19)  becomes 


Qcmd 


L 

H - 

mV 


gcos(/) 

V 


SQC(/) 


(33) 


Of  course,  this  still  has  the  implied  assumption  that  a  =  0  —  y  which  in  only  valid  for  “wings-level”  flight,  but 
including  Eq.  (31)  helps  compensate  for  this. 

With  this  addition  and  use  of  the  gains  in  Table  1,  the  max  CR  results  were  acceptable  with  only  an  average 
difference  of  2.2  %  between  the  actual  and  commanded/desired  values  as  seen  in  figures  19,  20,  and  21.  The  cost 
for  the  optimal  reference  trajectory  and  the  simulation  were  664,862  ft  and  671,781ft,  respectively,  which  results  in 
only  a  1.04  %  error. 
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Fig.  18  Max  CR  Linear  Body-Relative  Velocities  (U,V,W) 
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Fig.  20  Max  CR  Euler  Angles  (Phi,  Theta,  Psi) 
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Fig.  21  Max  CR  Airspeed  (V)  and  Wind-Relative  Angles  (Alpha,  Beta,  Gamma) 
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Fig.  22  Max  CR  Elevon  Control  Surface  Deflections 


Fig.  23  Max  CR  Body  Flap  Control  Surface  Deflections 
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Fig.  24  Max  CR  Rudder  Control  Surface  Deflections 


Similar  to  the  max  DR  case,  Figures  22-24  show  that  all  of  the  control  surface  deflections  in  the  6-DOF  simulation 
remain  within  their  respective  limits  as  indicated  by  the  dash-dot  lines.  Again,  as  indicated  in  Fig.  23,  there  is  only 
one  region  of  body- flap  control  saturation  from  approximately  240-280  sec.  Also,  Fig.  25  shows  that  the  anti¬ 
windup  values  remain  approximately  zero  (10'^^)  throughout  the  flight  simulation. 


Time  (sec) 

Fig.  25  Max  CR  Anti-Windup  Signals 
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Fig.  26  Max  CR  Comparison  of  Reference  and  Tracking  Trajectory 

Figure  26  verifies  that  the  guidance  commands  result  in  an  acceptable  reentry  trajectory  when  comparing  the  6-DOF 
simulation  and  the  3 -DOF  reference  x,y,z-state  histories. 

VI.  Conclusions 

This  paper  presented  the  successful  integration  of  a  reconfigurable  inner-loop  control  law  consisting  of  DI, 
control  allocation,  model  reference  prefilters,  and  anti-windup  integrators  with  an  outer-loop,  optimal  guidance 
command  generator.  As  demonstrated,  the  inner-loop  control  law  was  capable  of  tracking  the  body-frame  angular 
rates  that  were  converted  from  the  wind-relative  a  and  ^  modulation  of  the  off-line  reference  trajectory.  Although 
the  optimal  trajectory  generation  was  done  off-line  for  this  work,  a  similar  model  has  already  been  demonstrated  to 
work  for  on-line  reentry  applications  using  the  same  direct  Legendre  pseudospectral  method  [16]. 

The  potential  drawback  of  the  presented  G&C  architecture  is  that  the  inner-loop  control  system  depends  on 
various  gains  that  require  off-line  tuning.  This  may  limit  the  system’s  use  for  on-board  autonomous  applications, 
especially  in  situations  involving  unplanned  maneuvers  and/or  flight  anomalies.  Even  with  the  added  robustness 
provided  by  the  prefilters,  anti-windup  mechanism,  and  the  reconfigurable  control,  additional  modifications  may 
still  be  needed  to  handle  unexpected  operational  conditions.  With  this  said,  concurrent  and  future  work  involves  the 
development  of  fully  autonomous  systems  that  can  account  for  any  unforeseen  circumstances,  including,  but  not 
limited  to  uncertain  aerodynamics,  significant  external  disturbances,  control  failures,  and  vehicle  damage. 
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